Testing Gricean Constraints on a WordNet-based Coherence Evaluation System
نویسندگان
چکیده
This paper presents a computational method for analyzing Gricean constraints for the purpose of evaluating text coherence. Our system consists of a knowledge base constructed on top of WordNet, an inference engine that establishes discourse semantic paths between the input concepts and a mechanism of relating them to the context. Grice's maxims provide conditions to test coherence, while the semantic paths provide the space on which these conditions are tested. The computational method is based on a marker-propagation technique that is independent of the size of the knowledge base. The paper describes the method and provides results obtained with the system. The key idea of this paper is to link Grice's maxims for conversational logic with high level inferences drawn from a linguistic knowledge base. We discuss how the Gricean maxims may be tested in a real natural language processing system and how the information extracted plays a role in the analysis of context, dialogue coherence, and intentions. A text is considered to be coherent when it is logically connected and intelligible. The logical connections are semantic relationships between concepts. We are experimenting with a text understanding system that operates on a large linguistic knowledge base with concepts and relations between concepts. Inference rules are constructed by chaining semantic relations in the knowledge base. A few of these rules lead to deductions , but many provide only plausible inferences. Semantic paths between concepts may be established as patterns of inference rules. The information along such paths provides an explanation for the relationship between two concepts. In this paper, we link these semantic paths to Gricean constraints. The concepts and the relations that form the semantic paths provide context and information about text coherence. The Gricean constraints are rst tested by analyzing the paths; and this provides information to validate or lter out irrelevant inferences. The path nding algorithm designed by us uses a marker propagation paradigm that is highly parallel and asynchronous. Unlike other attempts that use marker-passing (Charniak 1986), (Norvig 1987), we use propagation rules that guide the movement of markers along some selected relations. Any marker present on a node may further propagate along semantic relations that (a) generalize, (b) deene the features or (c) entail or are entailed by the properties of the concept represented by that node. Whenever two markers originated from diierent nodes of the knowledge base reach the same node, a collision of the …
منابع مشابه
Deriving Metonymic Coercions from WordNet
This paper presents a method for deriving metonymic coercions from the knowledge available in WordNet. Two different classes of metonymies are inferred by using (1) lexico-semantic connections between concepts or (2) morphological cues and logical formulae defining lexical concepts. In both cases the derivation of metonymic paths is based on approximations of sortal constraints retrieved from W...
متن کاملA New WordNet Enriched Content-Collaborative Recommender System
The recommender systems are models that are to predict the potential interests of users among a number of items. These systems are widespread and they have many applications in real-world. These systems are generally based on one of two structural types: collaborative filtering and content filtering. There are some systems which are based on both of them. These systems are named hybrid recommen...
متن کاملAn Optimal Approach to Local and Global Text Coherence Evaluation Combining Entity-based, Graph-based and Entropy-based Approaches
Text coherence evaluation becomes a vital and lovely task in Natural Language Processing subfields, such as text summarization, question answering, text generation and machine translation. Existing methods like entity-based and graph-based models are engaging with nouns and noun phrases change role in sequential sentences within short part of a text. They even have limitations in global coheren...
متن کاملSemantics-Based Reference Resolution in Technical Text Processing: An Exploration of Using the WordNet Database in the Computerized Comprehensibility System
The Computerized Comprehensibility System (CCS) provides an automated copy editing function, generating a "mark-up" of a draft of a technical document by simulating the simpler comprehension processes of a human reader, and then criticizing the text when these simple processes cannot successfully comprehend the material. A key CCS function is criticizing the coherence of the material by trackin...
متن کاملTowards semi-automatic methods for improving WordNet
WordNet is extensively used as a major lexical resource in NLP. However, its quality is far from perfect, and this alters the results of applications using it. We propose here to complement previous efforts for “cleaning up” the top-level of its taxonomy with semi-automatic methods based on the detection of errors at the lower levels. The methods we propose test the coherence of two sources of ...
متن کامل